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Method and Apparatus for Delivery of Metadata Synchronized 

to Multimedia Contents 

5 CROSS REFERENCE TO RELATED APPLICATION 

This application is based on Korea Patent Application No. 2001-4341 
filed on January 30, 2001 in the Korean Intellectual Property Office, the 
content of which is incorporated herein by reference. 

10 BACKGROUND OF THE INVENTION 

(a) Field of the Invention 

The present invention relates to an apparatus and method for 
synchronizing metadata with multimedia contents, and transmitting them. 

(b) Description of the Related Art 

15 Metadata description methods for representing Essence, which is 

multimedia contents, and their standardization activities are now in progress. 
However, prior art only disclose metadata description methods and do not 
include synchronization and transmission methods of the multimedia 
contents and related metadata. The specifications of the metadata 

20 description method are found from MPEG, SMPTE, and TV Anytime. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a method for 
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synchronizing metadata with multimedia contents, and transmitting them, 
and for a terminal to receive the multimedia contents including the metadata 
and use them. 

In one aspect of the present invention, a metadata transmitter 
5 synchronized with multimedia contents comprises: a multimedia contents 
authoring unit for generating and editing multimedia contents; a multimedia 
contents format converter for compressing the multimedia contents, 
converting them into a transmission format for synchronization and 
transmission, and outputting them; a metadata authoring unit for generating 

10 and editing metadata for describing the multimedia contents, the metadata 
including transmission types and transmission information; a metadata 
format converter for converting the metadata into binary codes, converting 
the converted metadata into a synchronization format for synchronization 
with the multimedia contents and a transmission format for transmission, and 

is outputting them; and a multiplexer for multiplexing the multimedia contents 
format and the metadata format respectively output from the multimedia 
contents format converter and the metadata format converter into a stream, 
and outputting it. 

The metadata format converter comprises: a metadata 
20 synchronization format converter for converting the metadata transmitted 
from the metadata authoring unit into binary codes, and converting them into 
a synchronization format for synchronization with the multimedia contents; 
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and a metadata transmission format converter for converting the data output 
from the metadata synchronization format converter, according to a 
transmission format 

In another aspect of the present invention, a method for 

5 synchronizing metadata with multimedia contents and transmitting them 
comprises: (a) generating and editing metadata for describing multimedia 
contents, the metadata including transmission types and transmission 
information; (b) converting the metadata into binary codes, and converting 
the converted metadata into a synchronization format for synchronization 

10 with the multimedia data; and (c) converting the metadata converted in (b) 
into a transmission format for transmission. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and 
15 constitute a part of the specification, illustrate an embodiment of the 
invention, and, together with the description, serve to explain the principles 
of the invention: 

FIG. 1 shows a metadata transmission system according to a 
preferred embodiment of the present invention; 
20 FIG. 2 shows a metadata format converter according to a preferred 

embodiment of the present invention; 

FIG. 3 shows a flowchart of a method for transmitting metadata 
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synchronized with MPEG-2 data according to a preferred embodiment of the 
present invention; 

FIG. 4 shows definitions of stream identifiers used for transmitting 
the metadata synchronized with MPEG-2 data according to a preferred 
5 embodiment of the present invention; 

FIG. 5 shows definitions of stream-type values used for transmitting 
the metadata synchronized with MPEG-2 data according to a preferred 
embodiment of the present invention; 

FIG. 6 shows an exemplified PES packet for synchronizing 
10 synchronous metadata with MPEG-2 data according to a preferred 
embodiment of the present invention; and 

FIG. 7 shows an exemplified PES packet for synchronizing 
synchronized metadata with MPEG-2 data according to a preferred 
embodiment of the present invention. 

15 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In the following detailed description, only the preferred embodiment 
of the invention has been shown and described, simply by way of illustration 
of the best mode contemplated by the inventor(s) of carrying out the 
20 invention. As will be realized, the invention is capable of modification in 
various obvious respects, all without departing from the invention. 
Accordingly, the drawings and description are to be regarded as illustrative in 
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nature, and not restrictive. 

FIG. 1 shows a metadata transmission system according to a 
preferred embodiment of the present invention. As shown, the metadata 
transmission system comprises a multimedia contents authoring unit 100; a 
5 multimedia contents format converter 200; a metadata authoring unit 300; a 
metadata format converter 400; and a multiplexer 500. 

The multimedia contents authoring unit 100 generates multimedia 
contents, edits them, and outputs them to the multimedia contents format 
converter 200. In this instance, the multimedia authoring process includes 

10 processes of generating and editing the multimedia data, and the editing 
process does not specify a particular process excepting auxiliary tasks 
including correcting and adding generated multimedia data. 

The multimedia contents format converter 200 compresses the 
multimedia contents input from the multimedia contents authoring unit 1 00, . 

is converts them into transmission format data for synchronization and 
transmission, and outputs them to the multiplexer 500. The multimedia 
contents format converter 200 performs synchronization format conversion 
and transmission format conversion. According to the embodiment of the 
present invention, the synchronization format includes: MPEG-2 PES 

20 (packetized elementary stream) packets, MPEG-4 SL (sync layer) packets, 
MPEG-4 FlexMux packets, and RTP (real time protocol) standard 
specifications, and the transmission format includes: MPEG-2 TS (transport 
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stream), IP (Internet protocol), and ATM (asynchronous transfer mode) 

oiaiiuaiu ojjcoiiiuaiiufio. 

That is, the multimedia contents format converter 200 compresses 
the multimedia contents using at least one method of standard specifications 
5 of MPEG-1 , MPEG-2, MPEG-4, H.261 , H.263, and H.26L In other words, for 
example, it compresses some of the multimedia data using the MPEG-4 
standard and compresses a remaining portion of the multimedia data using 
the H.263 standard, so the whole of the multimedia data may comprise the 
MPEG-4 data and the H.263 data. 

10 After the compression process, the multimedia contents format 

converter 200 converts the compressed multimedia contents into a 
synchronization format using at least one standard specification of the 
MPEG-2 PES packet, the MPEG-4 SL packet, the MPEG-4 FlexMux packet, 
and the RTP packet, and converts them into a transmission format using at 

15 least one standard specification of the MPEG-2 TS, the IP, and the ATM. 

The metadata authoring unit 300 generates and edits metadata for 
describing the multimedia cdntents, and outputs them to the metadata format 
converter 400. According to the embodiment of the present invention, the 
metadata authoring unit 300 performs an authoring process using one of 

20 MPEG-7, SMPTE (Society of Motion Picture and Television Engineers), TV 
Anytime, and EBU (European broadcasting union) standard specifications on 
the XML (extensible markup language). In this instance, the metadata 
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authoring unit 300 concurrently generates transmission types and 
transmission information at the time of authoring. 

The metadata format converter 400 converts the metadata input from 
the multimedia contents authoring unit 100 into binary codes, converts them 
5 into a transmission format for synchronization and transmission, and outputs 
them to the multiplexer 500. The metadata format converter 400 performs 
synchronization format conversion and transmission format conversion. The 
synchronization format includes data characteristics, relations with whole 
streams, time information, and length information of a charged load, and the 

10 transmission format representing a format needed for transmitting packetized 
data includes sequence information and data types of the charged load. 

That is, the metadata format converter 400 converts the metadata 
into binary codes using at least one of the MPEG-7, the SMPTE, the TV- 
Anytime, and the EBU standard specifications, converts the converted 

15 metadata into a synchronization format using at least one of the MPEG-2 
PES packet, the MPEG-4 SL packet, the MPEG-4 FlexMux packet, and the 
RTP packet standard specifications, and converts them into a transmission 
format using at least one of the MPEG-2 TS, the IP, and the ATM standard 
specifications. 

20 The multiplexer 500 multiplexes the multimedia contents input from 

the multimedia contents format converter 200 and the metadata input from 
-the metadata format converter 400 into a single stream, and transmits it to a 
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transmission network 600. 

An interactive terminal 700 processes the stream transmitted via the 
transmission network 600 so that a user may use desired multimedia 
contents and metadata. 
5 FIG. 2 shows the metadata format converter 400 according to a 

preferred embodiment of the present invention. As shown, the metadata 
format converter 400 comprises: a metadata synchronization format 
converter 420; and a metadata transmission format converter 440. 

The metadata synchronization format converter 420 converts the 
10 XML-language metadata transmitted from the metadata authoring unit 300 
into binary codes, and converts them into a synchronization format. The 
metadata transmission format converter 440 converts the data transmitted 
from the metadata synchronization format converter 420 into predetermined 
data according to respective transmission formats, and outputs them to the 
15 multiplexer 500. 

In this instance, the subsequent two methods can be used to 
synchronize the metadata according to the preferred embodiment of the 
present invention. 

The first method is to packetize the metadata into packets identical 
20 with those for transmitting speech and image data. In detail, the metadata 
are packetized in the sequential order of the RTP packet and the IP packet in 
the internet network case, they are packetized into TS packets after PES or 
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section packetizing in the broadcasting network case, and they are 
sequentially packetized in the order of the SL packet and the FlexMux packet 
in the MPEG-4 case. In this instance, since the RTP packet, the PES packet, 
or the SL packet for packetizing the metadata has decoding time or output 

5 time value, it is required to packetize the metadata according to the time 
value. However, the first method is required to support each network's 
decoder model according to categories of transmitting networks. That is, 
since it is needed for the decoder to analyze the packets that have 
respective networks' time values and to connect to a decoder for decoding 

10 the metadata, it is impossible to amend to each system decoder model. 

The second method is to convert the metadata into a 
synchronization format and synchronize it with multimedia data. This method 
enables to synchronize data and transmit them with no relation to the 
transmission networks. In this instance, it is necessary for the decoder model 

is to use the decoder model of the metadata without using that of each 
transmission network. Also, since the metadata synchronization format has 
independent decoding time and output time values, it enables to operate the 
decoder model and support synchronization. In this instance, the decoding 
time value and the output time value refer to the metadata's time default 

20 value and time reference value to represent the metadata's decoding time 
and output time. 

The metadata synchronization format converter 420 comprises: a 
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metadata binary code converter 421; a metadata synchronous format 
converter 422; a packetizing controller 423; an RTP packetizer 424; an 
MPEG-2 packetizing controller 425; a PSI (program specific information) 
generator 426; a PES packetizer 427; a section packetizer 428; and an SL 

5 packetizer 429. 

The metadata binary code converter 421 converts the metadata 
stored in the XML language into binary codes so as to transmit the metadata 
generated from the metadata authoring unit 300. The metadata synchronous 
format converter 422 converts the binary codes into a metadata 

10 synchronization format so as to synchronize and transmit them with no 
relation to the transmission networks. In this instance, the metadata 
synchronization format independently has decoding time and output time 
values so as to operate the decoder model and support synchronization. 
Also, the decoding time value and the output time' value refer to the 

15 metadata's time default value and time reference value to represent the 
metadata's decoding time and output time. 

The packetizing controller 423 selects a metadata's transmission 
network so as to make the transmission network of the multimedia contents 
coincide with that of the metadata. 

20 The RTP packetizer 424 packetizes the metadata into an RTP, and 

the SL packetizer 429 packetizes synchronous, synchronized, and 
asynchronous metadata into an MPEG-4 SL packet. 

10 
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In this instance, the technical terms "synchronous" and 
"synchronize" are generally used in data broadcasting. To synchronize is to 
match an image frame with an audio frame at a time axis so as to regulate 
syncs of images and speech, or to match an axis of additional data with a 
5 time axis that matches the speech with the images. To be synchronous is to 
match additional data with another independently-operating time axis that 
does not correspond to the time axis for synchronizing the speech or images. 

The MPEG-2 packetizing controller 425 classifies the metadata input 
to be packetized into an MPEG-2 system specification, as the metadata that 

10 have synchronization time values and other metadata that do not have them, 
outputs the metadata that have synchronization time values to the PES 
packetizer 427, outputs the metadata that do not have synchronization time 
values to the section packetizer 428, and transmits PSI information including 
metadata transmission types and transmission information to the PSI 

is generator 426. 

In this instance, the PSI for representing information defined for a 
decoder to decode programs includes: a PAT (program association table); a 
PMT (program map table); an NIT (network information table); and a CAT 
(conditional access table). The PAT and the PMT represent information on 

20 program elements that form a program, the NIT shows information on the 
transmission networks, the CAT indicates information on conditional 
receiving, and the PES represents a data structure used for carrying 
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elementary streams. 

Since the metadata are transmitted in the PES or sections, the PSI 
generator 426 receives a transmission type and transmission information and 
records them on the PMT section so as to provide related terminal 
5 information. 

The PES packetizer 427 packetizes the synchronous and 
synchronized metadata into an MPEG-2 PES. Since a PES packet header 
includes a DTS (decoding time stamp) and a PTS (presentation time stamp), 
synchronization is possible based on them. 
10 The section packetizer 428 packetizes asynchronous metadata into 

sections. Since a section header does not include synchronous and 
synchronized time values, it is used for transmitting asynchronous metadata. 

The metadata transmission format converter 440 comprises: an IP 
packetizer 441; a TS packetizer 442; and a FlexMux packetizer 443. The IP 
is packetizer 441 packetizes the metadata into an IP, and the FlexMux 
packetizer 443 packetizes the metadata into a FlexMux. 

In this instance, the FlexMux represents a multiplexing method of 
options provided by the MPEG-4 system. That is, the FlexMux packet is used 
for reducing an overhead of a transmission multiplexer (TranMux) or 
20 allocating a channel of the transmission multiplexer when multiplexing a 
plurality of streams. In general, the MPEG-4 stream is to be packetized into 
an SL packet in a sync layer, but the overhead can be reduced by 
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packetizing one or a plurality of SL packets into a single FlexMux packet. 
Also, each MPEG-4 stream together with a logical channel is to be 
transmitted to a terminal from a server, and the FlexMux packet allocates 
logical channels for the respective MPEG-4 streams. 
5 The TS packetizer 442 packetizes a PMT table input from the PSI 

generator 426, metadata input from the PES packetizer 427, and metadata 
input from the section packetizer 428 into transport streams (TS). 

A method for using MPEG-2 data as multimedia contents, 
synchronizing the metadata with the multimedia contents, and transmitting 
10 them in a digital broadcasting will now be described. 

FIG. 3 shows a flowchart of a method for synchronizing the metadata 
with the MPEG-2 data and transmitting them according to a preferred 
embodiment of the present invention. 

When metadata are input from the metadata authoring unit 300 and 
is the metadata binary code converter 421 in step S9, the input metadata are 
analyzed in step S10. In this instance, it is determined whether they need to 
be synchronized with the MPEG-2 data in step S1 1 , and when needed, they 
are packetized into PES packets in step S1 2, and when not needed, they are 
packetized into private sections in step S13. Also, the metadata are analyzed 
20 to generate PSI in step S14, and the generated PSI, the PES, or the data 
packetized into private sections are packetized into TS packets in step S15. 
The TS-packetized metadata are multiplexed with MPEG-2 audio/video TS 

13 

BNSDOCID: <WO_Q2061596A1_L> 



WO 02/061596 



PCT/KR02/00137 



through an input of a synchronization initial value to be output as a single TS 
in step S16. In a detailed method for synchronizing the metadata with the 
MPEG-2 data, a metadata time default value and a metadata time reference 
value are defined and used so as to synchronize the metadata with a system 
5 time reference value, that is, an STC (system time clock), and a program 
time reference value, that is, a PCR (program clock reference) defined by the 
MPEG-2 system standard. 

Since the STC defined by the MPEG-2 system standard is an STC 
operating at 27MHz, the STC is to be cooperated with the metadata time 
10 default value as a basic condition for synchronizing the metadata with the 
MPEG-2 data, which is expressed in Equation 1 . 

Equation 1 

/S7u( f )/ fMe la da U ,Ti, ne Base( t )= + foteger 

where f STC (t) represents a system clock signal of 27MHz, and 
15 fMetadataTimeBase(t) indicates a metadata time default value. 

Further, since the PCR defined by the MPEG-2 system standard is a 
PCR sampled by 90KHz, the metadata time reference value is divided by the 
integer of 90KHz so as to synchronize the metadata with the PCR, which is 
expressed in Equation 2. 
20 Equation 2 

(/src(0/300)/ Weft ,^ oaodkRe/erCTce = ^Integer 

where (fsrc(t)/300) represents 90KHz, and 1 MtHi ^ maM t KinBa indicates 
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a metadata time reference value. 

For further detailed description of Equations 1 and 2, in the MPEG-2 
system, the system clock signals are to be timed at 27MHz so as to match 
the operation of the encoder and the decoder. For this purpose, the 
5 operation of the encoder at 27MHz is to be provided to the decoder, which is 
enabled by transmitting the PCR that are values sampled at 90 KHz. The 
reason for transmitting the values sampled at 27MHz/300=90KHz is to 
maintain the compatibility between the MPEG-1 and the MPEG-2, since the 
MPEG-1 operates at 90KHz. In this instance, Equation 1 represents that 

10 since the system clock of multimedia data operates at 27MHz, the clock of 
the metadata is to operate at a clock signal divided by an integer 
corresponding to this, thereby enabling synchronization between them. In the 
like manner, Equation 2 shows that a metadata time reference value is to 
have a time reference value, with respect to the multimedia data transmitting 

15 a time reference value sampled by 90KHz, as many as the number obtained 
by dividing 90KHz by an integer so as thus to enable synchronization 
between them. 

In the preferred embodiment of the present invention, in order to 
synchronize the metadata that require synchronization with the MPEG-2 data 
20 and transmit them, the metadata are packetized into access units using the 
MPEG-2 system standard. That is, to synchronize the metadata with the 
-MPEG-2 data, the metadata are packetized into packets using the PES 
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packets as defined by the MPEG-2 system standard, and they are 
transmitted using the TS. In order to packetize the metadata into PES 
packets through a detailed implementation method for synchronizing the 
metadata with the MPEG-2 data, a stream identifier (streamjd) of a PES 
5 packet header defined by the MPEG-2 system standard is extended as 
follows. 

The stream identifier (streamjd) of the PES packet for transmitting 
the metadata is a field that represents what category of data the charged 
load of the PES packet is. Stream identifier values for the metadata are not 
10 defined in the current international standard, but the present embodiment 
defines a streamjd for the metadata and uses it, and accordingly, the 
metadata may be carried on the charged load of the PES packet to be 
transmitted, which can be expressed as follows. 
PES_packet( ) { 
15 Packet_start_code_prefix 

Streamjd = Metadata stream 

PES_packet_length 

} 

In this instance, a value OxFC is allocated as a stream identifier for a 
20 newly defined metadata stream as shown in FIG. 4. 

Also, in the preferred embodiment of the present invention, in order 
to transmit the metadata that do not require synchronization, the metadata 
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are packetized using the MPEG-2 system standard. In order to transmit the 
metadata without synchronizing them, the metadata are packetized using the 
private sections, and they are transmitted using the TS as defined by the 
MPEG-2 system standard. 
5 In the preferred embodiment of the present invention, in order to 

transmit the metadata and apply them to a terminal, a message that the 
metadata are transmitted is reported to the terminal by using the MPEG-2 
system standard. That is, in order to report a metadata transmission notice to 
the terminal using the MPEG-2 system, a stream type of a PMT table header 
10 defined by the MPEG-2 system standard is extended as follows. 
TS_program_map_section ( ) { 
table_id 

section_syntax_indicator 
'0' 

15 

// Video 

stream_type = 0x03 (ISO/IEC 13818-2 

Video) 

reserved 

20 elementaryJPID 

// Audio 

streamjtype = 0x04 (ISO/IEC 13818-3 
17 
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Audio) 

reserved 

e 1 eme nt ar y_P I D 

5 

// Metadata 

stream_type = 0x15 (Metadata stream 
carried in PES packets ) 

reserved 

10 elementary_PlD 

stream_type = 0x16 (Metadata stream 
carried in Private Section) 

reserved 
elementary__PID 

15 

} 

CRC_32 

} 

As described above, the PMT represents information on the element 
20 bit streams configuring a program, defines identifiers of respective element 
bit streams, and adds descriptors to show information on detailed element bit 
streams. However, since the current standard does not have stream_type 
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values for the metadata in the PMT table in a similar manner as definition of 
strearnjd, a stream_type value is to be set so as to configure the metadata 
into data related to a single program. Hence, the present invention defines 
and uses the stream_type value to synchronize the multimedia data with the 

5 metadata and transmit them. 

As shown in FIG. 5, the stream-type values for the newly defined 
metadata stream have a value of 0x15 in the case of the metadata 
transmitted to the PES packet, and a value of 0x16 in the case of the 
metadata transmitted to the private section. 

10 Finally, in the preferred embodiment of the present invention, in 

order to synchronize the metadata that require synchronization with the 
MPEG-2 data and transmit them, a CTS (composition time stamp)/DTS 
(decoding time stamp) time value of a metadata access unit is used as an 
input of a PTS (presentation time stamp)/DTS time value when packetizing 

15 the metadata into PES packets. 

The metadata for being synchronized with the MPEG-2 data are 
classified in two ways. The first is as synchronous metadata, and the second 
is as synchronized metadata. Since the synchronous metadata stream is 
organically operated, the synchronous metadata can be synchronized with 

20 the multimedia contents by adding a synchronization initial value (Offset) to 
each CTS time value of the metadata stream to generate a PTS value, which 
is expressed in Equation 3. 
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Equation 3 

PTS(t) - CTS(t)+" Offset" 

FIG. 6 shows a PES packet format for synchronizing the metadata 
with the MPEG-2 data. 
5 Since the synchronized metadata is not organically operated, the 

synchronized metadata can be synchronized with the MPEG-2 data by 
inputting each CTS time value of the metadata stream through a value 
identical with that of a PTS time value, which can be expressed as in 
Equation 4. 
10 Equation 4 

PTS(t) = CTS(t) 

FIG. 7 shows a PES packet format for synchronizing the 
synchronized metadata with the MPEG-2 data. Through the above process, 
the synchronous and synchronized metadata can be synchronized with the 
is multimedia contents, and they are packetized into 188-byte TS packets and 
multiplexed with input MPEG-2 audio/video TS so as to transmit them. 

According to the present invention, a detailed implementation 
method for synchronizing the metadata used as additional information in the 
digital broadcasting with the MPEG-2 data and transmitting them is provided, 
20 thereby enabling transmitting the metadata in real-time, enabling the user's 
random access, and applying the two kinds of data in various ways. 

While this invention has been described in connection with what is 

20 
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presently considered to be the most practical and preferred embodiment, it is 
to be understood that the invention is not limited to the disclosed 
embodiments, but, on the contrary, is intended to cover various modifications 
and equivalent arrangements included within the spirit and scope of the 
5 appended claims. 
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WHAT IS CLAIMED IS: 

1 . A metadata transmitter synchronized with multimedia contents 
comprising: 

a multimedia contents authoring unit for generating and editing 
5 multimedia contents; 

a multimedia contents format converter for compressing the 
multimedia contents, converting them into a transmission format for 
synchronization and transmission, and outputting them; 

a metadata authoring unit for generating and editing metadata for 
10 describing the multimedia contents, the metadata including transmission 
types and transmission information; 

a metadata format converter for converting the metadata into 
binary codes, converting the converted metadata into a synchronization 
format for synchronization with the multimedia contents and a transmission 
15 format for transmission, and outputting them; and 

a multiplexer for multiplexing the multimedia contents format and 
the metadata format respectively output from the multimedia contents format 
converter and the metadata format converter into a stream, and outputting it. 

2. The transmitter of claim 1, wherein the metadata format 
20 converter comprises: 

a metadata synchronization format converter for converting the 
metadata transmitted from the metadata authoring unit into binary codes, 
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and converting them into a synchronization format for synchronization with 
the multimedia contents; and 

a metadata transmission format converter for converting the data 
output from the metadata synchronization format converter, according to a 
5 transmission format. 

3. The transmitter of claim 2, wherein the synchronization format 
independently includes a decoding time value and an output time value. 

4. The transmitter of claim 3, wherein the decoding time value and 
the output time value are established by referring to a time default value and 

10 a time reference value of the metadata. 

5. The transmitter of claim 2, wherein the metadata 
synchronization format converter comprises: 

a metadata binary code converter for converting the metadata 
generated by the metadata authoring unit into binary codes; 

15 a metadata synchronous format converter for converting the 

converted binary codes into a metadata synchronous format including a 
metadata time default value and a metadata time reference value so as to 
synchronize the converted binary codes and transmit them with no relation to 
transmission networks; 

20 an MPEG-2 packetizing controller for controlling to classify the 

metadata output by the metadata synchronous format converter as metadata 
that have a synchronized time value and metadata that do not have a 
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synchronized time value, packetize the metadata that have a synchronized 
"time vaiue into PES (packetized elementary stream) packets, packetize the 
metadata that do not have a synchronized time value into sections, and 
generate PSI (program specific information) including metadata transmission 
.. 5 types and transmission information; 

a PSI generator for writing the PSI output by the MPEG-2 
packetizing controller in a PMT (program map table) section; 

a PES packetizer for packetizing the metadata that require 
. synchronization and are output from the MPEG-2 packetizing controller into 
10 PES packets; and 

a section packetizer for packetizing the metadata that do not 
require synchronization and are output from the MPEG-2 packetizing 
controller into sections. 

6. The transmitter of claim 5, wherein the metadata 
15 synchronization format converter further comprises: 

an RTP (real time protocol) packetizer for packetizing the 
metadata output from the metadata synchronous format converter into an 
RTP; 

an SL (sync layer) packetizer for packetizing synchronous 
20 metadata, synchronized metadata, and asynchronous metadata output from 
the metadata synchronous format converter into MPEG-4 SLs; and 

a packetizing controller for selecting one of the RTP packetizer, 
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the MPEG-2 packetizing controller, and the SL packetizer, and transmitting 
the metadata output from the metadata synchronous format converter so as 
to match a transmission network of the multimedia contents with that of the 
metadata. 

5 7. The transmitter of claim 5, wherein the time default value used 

for the metadata synchronous format is obtained by dividing a time reference 
value that is an STC (system time clock) defined by the MPEG-2 system 
standard by an integer, and the metadata time reference value used for the 
metadata synchronous format is obtained by dividing a program time 
10 reference value that is a PCR (program clock reference) by an integer. 

8. The transmitter of claim 5, wherein the PES packetizer extends 
a stream identifier of a PES packet header defined by the MPEG-2 system 
standard to packetize the metadata that require synchronization into PES 
packets. 

15 9. The transmitter of claim 8, wherein the metadata that require 

synchronization are synchronous metadata, and a PTS (presentation time 
stamp) used for a format of the PES packet is a value obtained by adding an 
offset value to a CTS (composition time stamp) of a metadata access unit. 

1 0. The transmitter of claim 8, wherein the metadata that require 

20 synchronization are synchronization metadata, and a PTS (presentation time 
stamp) used for a format of the PES packet is matched with a CTS 
(composition time stamp) of a metadata access unit. 
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11. The transmitter of claim 5, wherein the PSI generator extends 
a stream type of the PMT table header defined by the MPEG-2 system 
standard so as to notify a terminal of a metadata transmission notice. 

12. A method for synchronizing metadata with multimedia contents 
5 and transmitting them comprising: 

(a) generating and editing metadata for describing multimedia 
contents, the metadata including transmission types and transmission 
information; 

(b) converting the metadata into binary codes, and converting the 
10 converted metadata into a synchronization format for synchronization with 

the multimedia data; and 

(c) converting the metadata converted in (b) into a transmission 
format for transmission. 

13. The method of claim 12, further comprising: (d) multiplexing a 
15 multimedia contents format and the metadata format output in (c) into a 

stream. 

14. The method of claim 12, wherein the synchronization format 
independently includes a decoding time value and an output time value. 

15. The method of claim 14, wherein the decoding time value and 
20 the output time value are established referring to a time default value and a 

time reference value of the metadata. 

16. The method of claim 12, wherein (b) comprises: 
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converting the metadata generated in (a) into binary codes; 

converting the binary codes into a metadata synchronous format 
including a metadata time default value and a metadata time reference value 
so as to synchronize and transmit the binary codes with no relation to 
5 transmission networks; 

controlling to classify the metadata output from the metadata 
synchronous format converter into metadata that have a synchronized time 
value and metadata that do not have a synchronized time value, and 
generate PSI (program specific information) including metadata transmission 
10 types and transmission information; 

writing the PSI in a PMT (program map table) section; 

packetizing the metadata into PES packets when the metadata 
require synchronization; and 

packetizing the metadata into sections when the metadata do not 
15 require synchronization. 

17. A metadata transmitter synchronized with multimedia contents 
comprising: 

a metadata authoring unit for generating editing metadata for 
describing the multimedia contents, the metadata including transmission 
20 types and transmission information; 

a metadata synchronization format converter for converting the 
metadata transmitted by the metadata authoring unit into binary codes, and 
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converting them into a synchronization format for synchronization with the 
multimedia contents; and 

a metadata transmission format converter for converting data 
output from the metadata synchronization format converter according to a 
5 transmission format. 

18. The transmitter of claim 17, further comprising a multiplexer for 
multiplexing the multimedia contents format and a metadata format output 
from the metadata transmission format converter into a stream, and 
outputting it. 

io 1 9. The transmitter of claim 1 7, wherein the synchronization format 

independently includes a decoding time value and an output time value, and 
the decoding time value and the output time value are established referring 
to a time default value and a time reference value of the metadata. 

20. The transmitter of claim 17, wherein the metadata 
is synchronization format converter comprises: 

a metadata binary code converter for converting the metadata 
generated by the metadata authoring unit into binary codes; 

a metadata synchronous format converter for converting the 
converted binary codes into a metadata synchronous format including a 
20 metadata time default value and a metadata time reference value so as to 
synchronize the converted binary codes and transmit them with no relation to 
transmission networks; 
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an MPEG-2 packetizing controller for controlling to classify the 
metadata output by the metadata synchronous format converter as metadata 
that have a synchronized time value and metadata that do not have a 
synchronized time value, packetize the metadata that have a synchronized 
5 time value into PES (packetized elementary stream) packets, packetize the 
metadata that do not have a synchronized time value into sections, and 
generate PSI (program specific information) including metadata transmission 
types and transmission information; 

a PSI generator for writing the PSI output by the MPEG-2 
10 packetizing controller in a PMT (program map table) section; 

a PES packetizer for packetizing the metadata that require 
synchronization and are output from the MPEG-2 packetizing controller into 
PES packets; and 

a section packetizer for packetizing the metadata that do not require 
is synchronization and are output from the MPEG-2 packetizing controller into 
sections. 



20 
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NOTES 

1 PES packets of type program_stream_map have unique syntax specified in 2.5 .4. 1 . 

2 PES packets of type private_stream 1 and ISO/IEC 13552 stream follow the same PES packet 
syntax as those for ITU-T Rec. H.262 ISO/IEC 13818-2 video and ISO/IEC 13818-3 audio streams. 

3 PES packets of type private_stream_2, ECM_s1ream and EMM stream are similar to 
private_stream_l except no syntax is specified after PES_packet_length field 

4 PES packets of type program_sheam_directory have a unique syntax specified in 2.5.5. 

5 PES packets of type DSM-CC_stream have a unique syntax specified in ISO/IEC 1381 8- 6. 

6 This stream_id is associated with stream_type 0x09 in Table 2-29 . 

7 This stream_id is only used in PES packets, which carry data from a Program Stream or an ISO/IEC 
1 1 1 72- 1 System Stream, in a Transport Stream ("refer to 2 .4.3.7). 
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